Newest 'normalization scikit-learn' Questions

0votes

1answer

232views

how to choose between data normalization or standadization?

I have been studying about data scaling. Two common methods for it are the StandardScaler and MinMaxScaler. As I understood, StandardScaler expects the data to be normally distributed, but I have seem ...

panchester

231

asked Aug 28, 2021 at 2:46

0votes

1answer

84views

How should a stateless data transformation be applied in regard to train/test split?

I want to apply spatial sign transformation to my data, but unlike other transformations this one is stateless. I am using sklearn and normallly i would first use ...

Mateusz

135

asked Jun 14, 2021 at 13:17

0votes

0answers

168views

Why does normalization improve my decision tree performances?

I have a regression problem for which I have to try several models, so I normalized my data and then tried to use a decision tree regressor (from sklearn.tree) and I noticed very good results (...

Michele

1

asked Feb 1, 2021 at 11:11

2votes

0answers

74views

Normalisation of features extracted from audio files

I am building CNN and SVM models which take in MFCC features as input. The MFCC matrices shape is (13, n). The 13 rows are coefficients and n columns represent n time frames. So each row in the matrix ...

user75252

141

asked Sep 2, 2020 at 11:08

0votes

2answers

48views

Selecting Transforms with sklearn pipelines

So I am currently working on a Data set, and I want to use Pipelines to select the transforms. Here is an example of what I want to do : ...

user98969

asked Aug 21, 2020 at 9:16

1vote

0answers

232views

why does Scikit Learn's Power Transform always transform the data to zero standard deviation?

all of my input features are positive. Whenever I tried to apply PowerTransformer with box-cox method, the lambdas are s.t. the transformed values have zero variance. i.e. the features become ...

Kevin Kim

33

asked May 4, 2020 at 8:50

9votes

2answers

7kviews

How to normalize data without knowing the min and max values?

I have a Lending club dataset from Kaggle; it contains many different columns: there are for example dummy variables, years, amount of the loan...ect I want to normalize the data in the training and ...

Ghassen Ben Hamida

93

asked Mar 25, 2020 at 21:10

2votes

4answers

6kviews

why to use Scaler.fit only on x_train and not on x_test for normalizing value using MinMaxScaler?

while normalising the data everone is saying that we need to fit only on x_train and not on x_test ? why is that we should not fit x_test ? if we should not fit the scaler on x_test then why we need ...

star

1,511

asked Dec 28, 2019 at 20:45

0votes

2answers

305views

SVM, which range to use when normalizing

I am using the SVM classifier from Scikit Learn. I was wondering is there is a know-best-practice when it comes to normalization. I'm using different normalization tecniques, but all my normalized ...

Federico Dorato

111

asked Oct 25, 2019 at 10:40

0votes

1answer

489views

What is the use of fit method in sklearn.preprocessing.Normalizer()?

According to the documentation of fit(self, X[, y]) method of sklearn.preprocessing.Normalizer(), it does nothing and return the estimator unchanged. I understand that if I intend to normalize data I ...

Vishal Poddar

1

asked Oct 10, 2019 at 16:57

1vote

1answer

7kviews

If I have negative and positive numbers for a feature, should MinMaxScaler be -1 to 1?

I have a variable X with values ranging from -150 to 400. All the other variables in my training set are positive so I normalized them to be from 0 to 1, or they’re already binary, or they had a ...

Insu Q

191

asked Sep 27, 2019 at 4:55

2votes

1answer

95views

Is it compulsary to normalize the dataset if doing so can negatively impact a Binary Logistic regression performance?

I am using raw data set with 4 feature variables to do a Binominal Classification using Logistic Regression Algorithm. I made sure that the class counts are balanced. i.e., an equal number of ...

GYSHIDO

133

asked Sep 9, 2019 at 12:55

2votes

0answers

218views

Comparing feature importance in LightGBM + Scikit

I have a model trained using LightGBM (LGBMRegressor), in Python, with scikit-learn. On a weekly basis the model in re-trained, and an updated set of chosen features and associated ...

Ripstein

208

asked Aug 26, 2019 at 7:57

2votes

1answer

1kviews

MinMaxScaler when LSTM predictions fall outside of training range?

I am using MinMaxScaler on my training set and applying the transformations to my test set and inverse_transform to my model’s outputs. If this were, say, a stock prediction problem, my training set ...

Insu Q

191

asked May 27, 2019 at 18:45

22votes

3answers

54kviews

When to use Standard Scaler and when Normalizer?

I understand what Standard Scalar does and what Normalizer does, per the scikit documentation: Normalizer, Standard Scaler. I know when Standard Scaler is applied. But in which scenario is Normalizer ...

Heisenbug

411

asked Feb 20, 2019 at 16:38

Stack Exchange Network

All Questions

how to choose between data normalization or standadization?

How should a stateless data transformation be applied in regard to train/test split?

Why does normalization improve my decision tree performances?

Normalisation of features extracted from audio files

Selecting Transforms with sklearn pipelines

why does Scikit Learn's Power Transform always transform the data to zero standard deviation?

How to normalize data without knowing the min and max values?

why to use Scaler.fit only on x_train and not on x_test for normalizing value using MinMaxScaler?

SVM, which range to use when normalizing

What is the use of fit method in sklearn.preprocessing.Normalizer()?

If I have negative and positive numbers for a feature, should MinMaxScaler be -1 to 1?

Is it compulsary to normalize the dataset if doing so can negatively impact a Binary Logistic regression performance?

Comparing feature importance in LightGBM + Scikit

MinMaxScaler when LSTM predictions fall outside of training range?

When to use Standard Scaler and when Normalizer?

Hot Network Questions

All Questions

Related Tags